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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
3 5 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAAC AACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAG A ATATTTTATT GCACTC ATTT TAATAGAAAT 
85 1 AACCAATTCT TGAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1 00 1 CTTGAGAAAG TCAAAGATTT TAACTTC AAC TACTATTTAA CAAAATCTTG 
1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG C GTC AC AC A A TTTATTAATG AATTTTTCTA 
1201 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
1 40 1 ACGTCTTATG G AAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
145 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1501 AACCTATTAC TAC AGAAAG A ATATTTGGGA CGTCATTATG A A A ATGTC A A 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCC AAGA AAAAGAGGTT 
1 60 1 GAAGAATGGA AAAAGTCGCT TGGATTTGC A CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCC AATTAT G ACTTTC AAT AAGAAGATTG 
1 70 1 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG C AAATGGAAG C AAGTTGGAC AACC AAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATT ACTTTC TTC AGATTTC TGGATTATGA 
2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTT AGAAAG A AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGC A AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
2201 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
225 1 CC AATAT A AT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
255 1 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
275 1 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
2951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGC ATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TC AATATGAT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TC A ATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
35 1 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
55 1 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKJLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
75 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
85 1 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
35 1 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
75 1 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGG 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAG A AG ACT AAAG AAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
105 1 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TCAC ATTC AT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAGAAATGG A GCCG A AATCT TAATCAAAAA G AATTGCGTC GATATTGC AA 
1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 + + + 4- + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

PQNPKTPKPL * KKKKLR'* F R 
PKTPKPQNPYKKRKNCGSLE- 
PKPQNPKTPIKKEKI E V V * K- 
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TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 
1 ♦ + + ___ + + 18( 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

YF L I HST S IAA LVVTR KDAK - 
TS*YIQQV--*QLL* *QERMQN- 
L P NT F NKYS S S C S DK KQ C K T - 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

1 «. » + «. + + 24C 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

H C N LARN RL HC LF Q SC K NN 
IEIWL EIAFIDYSKVAKTIR- 
LKSGSKSPSLTI PKLQ KQ LK- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 
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TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 
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TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 
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ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

CGLFYFLDHFLRS IMEKIT* 
E D Y S I F * ITS * G A L W R K L L N - 
RIILFFRS LLKEHYGENYLJ- 

TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 
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ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 
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LKGKQFGLFP 'PTMMSILNS- 
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CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 
GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 
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AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 
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TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

KTQEKVC'SNSRRTYCIYYS 
KRKKK .FDNRTAEEL-IAFTIR- 
NARKS L I I EQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 
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ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

YGFYYNCFRYRRCTPESCDN 
MGFITIVLG I DGELP S LETI- 
WVLLQLF*VSTVN SRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 
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ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 
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TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 
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ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 
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CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 
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GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 
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GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 
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GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 
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CCTTAATTGTTCTTCTTATAATCCTTCCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 
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TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 ♦ + + + - * 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

LKISK IPGKRDTFIKIHIL* 

* RFQKFQVREIHSLKFIYYS- 
KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 ♦ + * + ♦ 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

FFISQLLFSFILTIFFD'LE 
F SFHSCYFLLS * Q Y F L I S W K - 
FHFTAVI F FY LNN I FC LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + * + ♦ * 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

VKSIK*EKR*TEVT*LIHIH 

* KVSNKRSARLR * LS LFTFI 
KKY QIREALDCG NLAYSHS *- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 ♦ + + + + * 1320 

TCTAGCTGGAAGTATATAGGTTATGGTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

RSTFIYPIR C'G N SSHPF'K 
DRPSYIQYDDKETAVIRFKN- 
IDLHISNTMIRKQQSSV LKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + * + * * 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

♦ CYED'IFRV KKWSRNLNQK 
SAMRTKFL ES RNG AE I L I KK 
VLCGLNF * S QEMEPKS * SKR- 

GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 - + + «- <■ 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

ELRRYCKR I E L I F R * ' V L P 

NCVOIAKESNSKS FVNKYY Q 
IASILQKNRTLNLSL IS I T N - 

ATCTTCATTCATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAACAAATAAA 

1441 - * --* * * 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

ILIDCRD'RCNCTEDH'RNK 
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LDCLKRLTRQ LH R R S L K K - S - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 * * * * ♦ 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

VTFIN'RIN'ITNIEISDLQ 

t,LLIRE*TKLLI*RSAIFN- 
NFY'LENKLNY - YRDQRSSI- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 * * 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

LTK'XLN' S'TIKNTNLG'QN 
CRNKSCT KVRQ* KIQTL.VKI 
DE IKAE LKLDM K K Y K P W S K Y - 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 -- * -- -- 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCCTTATTTATTTTACT 

IEEGXEDQLAKEK.IRQ* IKC 
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CGRKRRPVSKRKMKA INKMS- 
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GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 * + + -' + + 1"?40 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

VQK.CRNKRF I FFNNLLKRGV 
YRSEEIKDLFFSI IYCKEGF- 
TEVKK'KIYFFQ ' FIEK RGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 ♦ -- 1762 

AACCCCAAAACCCCAAAACCCC 
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1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
6 1 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
241 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
54 1 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca- aaggacacac 
1 62 1 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1741 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1 98 1 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
2041 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
2221 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 
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1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact.gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
84 1 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1 62 1 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1 92 1 ttccFcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 1 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
228 1 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
240 1 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
246 1 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
252 1 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 




FIGURE 22 



MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
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FSVNLQATQEIYDSLHKLLIRSTNLKXFKLSYKYEMEKSmDTFIDLKJSflYETLNNLK 

RCSVNISNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFNNVKSAKIE 

SSSLESLEDIDSLCKSIASCKNLQNWIIASLLYPNMQKNPFNKPNLLFFKQFEQLK 

NLENVSmCILDQfflLNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 
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LCLPTGTYYDYNSDRW 
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MKJLFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKIARNEDVNNSLFCHS 
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WVQRSS S S S ATAAQIKQLTEP VTNKQFLHKLNINSS S FFP YSKILPSSSSIKKLTDLR 

EAIFPTNLVKJPQRLKVRINLTLQKLLKRHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKFIIVILQKLLPQEMFGSKK^KGKIIKNLNLLLSLPLNGYLPFDSLLKKL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYFRFIDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRIIPKKSNNEFR 

IIAIPCRGADEEEFTIYKENHKNAIQPTQKILEYLRNKRPTSFTKIYSPTQIADRIKE 

FKQRLLKj^NNVLPELYFMKFDVKSCYDSIPRM^^ 
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NFfflRSKSSKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSrNVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNIILLRKEIQHLQAYIYIYIHIVN 
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Oxytricha LCVSYILSSFYYANLEENALQFLRXESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

human " AXFLHWLKSVYVVELLRSFFYVTETTFQKHR 
tezl ISEIEVn.VLGKRSNAKMCLSDFBKRKfilFAEFIYVrLYKSFIIPILQSFFYlTBSSDLRHR 

BST2 LKDFRWLFISD IWFTKHHFENLNQLAICFISWLPRQLIPKI IO;TPFYCTEISSTVT- 

pl23 TRElSWMQVET-8AKHFYYFDHEN-IYVLWKLLRWIFEDLWSLIRCFFYVXEQQKSySK 

Motif 1 

human LFFYRKSVWSKL2SIGIRQHL,KRVQLRDVSEAEVRQHREARPALLTSRLRFIPKP— dql 

Q>Z 1 TVYFRKDIWKLLCRPFl-TSMKMEAFEKIHENNVRMDTQK-TXLPPAVlRliLPKK— »TF 

E]ST2 IVYFRHDTWNKLITPFIVEYFKTYLVEKKVCRNHNSYTLS--MFNBSKMRIIPKKSHMEF 
pji23 TYYYRKNIWDVIKKMSI-ADLKKEXLAEVQEKEVEEWKKS-LOFAPGKLRLIPKK— TTF 

if] ..*.*. * * . * . * * 

Motif 2 

human RPIVNMDYWGARTFRREKRAERLTSRVKALF-SVLNYERA 

tezl RX.ITN-LRKRFLIKMGSMKKMLVSTNQTLRPVASILKHLINEESSGIPFNLEVYMKLL.TF 
EST2 RI I A I PCRGADEEEFT I YKENHKMAI QPTQKI UEYLRHKRPTS FTK I YSPTQ I ADRI KEF 

pl23 RP I MTFNKK I VNS DRKTTKLTTHTKLLHS HLWLKTLKN- RMFKDPFGF AVFH Y DDVMKKY 

* * * * . . 

Motif 3 (A) 

Bbezl KKDLLKHRMFGR-KKYFVRIDIKSCYDRIKQDLMFRIVKK-KLKDPEFVIRKYATIHATS 
BST2 KQRLLKKFMMVLPELYFMKFDVKSCYDSIPRMECMRILKD-ALKNEKGFFVRSQYFFMTK 
=lpl23 EBFVCKWKQVGCPKLFFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKM 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGGCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGT CAAAT CTTGCTATGATT C 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGGTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGT CTTTTT CAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTGACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKJFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 
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MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 

RTIETSITQNKSARKEVSVmSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 

GKRSNAKMCLSDFEKJRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKmENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTNQT 

LRPVASILKHLINEESSGIPFNLEVYMKLLTFKKX)LLKHRMFGRJKXYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKKFLNLSLRGFEKFINFSTSLEKTVIN^ 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 
YRLGYSMCMRAQAYLKJIMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 
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ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaagaactcaataacaataccaagtcaaattccaatatgaagg 
tgttattagtgatcgataatamctatmatcggtcgttaccaagtataaggacaaaaagaacaacttccttccccctaaagacttttacmattaatttacttttcaaatatatttcg 
ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatatagctcttggagtagctcacagaaatccttacaaatctt 
ctgatgagactatattagattcattacagtccgtgcatattcttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttg 
aaaaggttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatgattgagatattcaaaaatttctatccactacaa 
ctccmaacgcggtmattmctatmctattctcatgttgttccaaatatgtatcatctcgt^^ 
- aataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaagatactttgcaaaacatttattagctatcattatataaaa 
aaaatcctataattataaatattaatcaatatttgcggtcactatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgcAT 
GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 
CCTTAAATGATTATGTACAACTTGTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 
CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTAGTCGGCTTCGACAGT 
AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatatttttgttttgatttttttctattcg 
ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 
ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagagattgtatttaaccgataaagAA 
TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 
CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggttaagatgttgcgcactttgaaca 
agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 
CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 
CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 
CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 
aactaatactgttatccttcataactaattttagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 
TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 
AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 
ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 
CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 
CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAAGCAAAGGATATTTGAGATAATATTAAAAGg 
tattgtataaaatttattaccactaacgattttaccagACCTCGAAACTTTCTTGAAATTATCGAGATACGAGTCTTTTAGTTTAC 
ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaattaacaatcagATTTCAGAAATTGAATGGCTAGT 
CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 
AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATCTTTTTTTTATATCACTGAATC 
AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 
TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtattttaaagtattttttgcaaaaagctaatattttcagAACAA 
TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 
CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtattaatttttggtcatcaatgtactttacttctaatctattattag 
cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 
AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 
TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaattttgcagGCGTAAGAAG 
TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 
AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 
AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagmatttmcattggaatttmaacaaattctttmagTTGATAT 
GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATTTTGT 
GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 
TAAGgtataccaattgttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 
AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 
TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 
AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagttctaaccgttgaagGATTTGAGAA 
ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 
ATACTTTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 
ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 
ATATGGGGAAATCTTTTTTTTACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 
ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 
GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
AAAGAATGTTCATAACGGgtgagtacttattttaactagaaaagtcattaattaaccttagATCTTTTGAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTTTTTTGTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcattttcaatttattatatacatcctt 

tattactggtgtcttaaacaatattattactaagtatagctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtcttt 

atccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaacattaaaagtaatacatgaggctaatctcctttcatttag 

aataaggaaagtggttttctataatgaataatgcccgcactaatgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatat 

acccagtgttgttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagccccaggttatccatggtggccg 

gccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcatttaatgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaata 

gtgttaagccattattggattccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgatttaaaggag 

gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcgga 

gagtatctccagcggatccttgatgtcaataacttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FFYCTEISST Vf&JVYFRHDT WN KLIT P FIVE YFK-TYLVEN 

FFYVTEQQKS Y13KTYYYRKN IWDVI-MKMS IAD- LKK ETLA--EVQE 

KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQFSEK NKQMKNNFYQ 

FFY.TE..K. .1..YYYRK. IW...-KL.. -F..K V. . 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



NVCRSfKNSY- TLSNFNHS 

KEVEEWKKSL GFAPC 

KIQLEEENLE KVEEKLI PED SFQKYPQC 
K . . . E . . . .F. .C 



fhMiKipcrg 

pf( E i ■JTFNKK 
FB E I /TTFLRK 
EsfaiMTF.RK 



EST2 pep . 
Euplotes pep 
Trans of tetrahymen 
Consensus 



ADEE&FTIYK. ENHKNAIQP^> l ,QKILE^|bpC 
IVNSDRKTTK LTTNTKLLNllHLMLKI t 0*- 



DKQK1JIK UJt33QILMD^LVFW>; | 



RPTSFTKIYS PTQIADRIKE 

RMFK -DPFGFA^FN 

ML-G -QXIGYSVFp 

.... -..IO..VF'. 



129 
120 
130 
150 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



f NVL- 
; EFVCKWKQVG 
fe"A QFIEKWKNKG 



i VKSCYD 
?FATMD 7.EKCYD 



157 
155 
158 
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S - 1 • FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RC)H LKR VQL RDV SEA EVR QHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: A1CF LHW LMS VYV VEL LRS T ^ ^ TO Q - 
A-2- LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3i PAL LTS RLR FIP KPD GLR PIV NMD YW 




I918-SIM /m aOd-IJi 

i sia-svM M aod-ia 
f la-sw /m aod-ia 



aod-ia ,e 

8Dd |DU|6uo 




(a)9 (0)9 (,a)f (v)e zi mow in 



1J©SU| VNQ O|Uiou©.0 



IuoluBdjj hi push q>l Z~ 
peousnbes dq ggss 



ee amoM 
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Polv 4 

t t 
t a a g c c 
5 ' - cag acc aaa gga att 
Q T K G I 



4<B') 



5(C) 

D D Y L L I T 

ctg ctg atg gag gag tag tgg 
a a a a a a a a a 
t t t t 
c c 
Polv 1 
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Motif B' (4) 
QTKGIPQG 



Motif C (5) 
DDYLLIT 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



Ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRI.T 

Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRI1TDDYLI1IT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK OSVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 



QKVGIPQG 

caa aaa gtt ggt ate cct cag gg < Actual Genomic Sequence. 

Poly 4 

t t c 

ta agec teg 

cag acc aaa gga att cca taa gg > 

ag acc aaa gga att cca tea ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 

'■■ tc tgg ttt cct taa ggt agt ccG AGT TAA GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

K G I P S G S I L S S F L C H F Y M 



GAA GAT TTG ATT 
CTT CTA AAC TAA 

E D L I 



GAT GAA TAC CTA 
CTA CTT ATG GAT 

D E Y L 



TCG TTT ACG AAA 
AGC AAA TGC TTT 

S F T K 



AAG AAA GGA TCA 
TTC TTT CCT AGT 

K K G S 



GTG TTG TTA CGA 
CAC AAC AAT GCT 

V L L R 



GTA GTC gac gac 
CAT CAG ctg ctg 

V V D D 

< c tg ctg 

a a 



etc 


etc 


ate 


acc 


gag 


gag 


tag 


tgg 


L 


L 


I 


T 


gag 


gag 


tag 


tgg 


a a 


a a 


a 


a 


t 


t 


t 


t 


c 









Poly 1 



gac gat ttc etc ttt ata aca. ...... < Actual Genomic Sequence. 

D D F L F I T 
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3' RT PCR Strategy 



■ AAAAAAAAAAAAAAAA. . 



1 . Synthesis of cDNA with Qt Primer. 
mRNA 



■ AAAAAAAAAAAAAAAA . . 



• Qt 

■ QO 



2. First Round PCR Using Outside Primer and Qo Primer. 



3. Second Round PCR Using Inside Primer and ©| Primer. 



4. Sequence Second Round PCR Products Using Inside Primer or Q| Primer. 
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-Size Selected Libraries from P. Nurese 

3~4kb 

5~6kb 

7 ~ 8 kb 

11 ~ 12 kb 
-Libraries from J.A. Wise 

Sau 3a Partial Digest 

Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
oi REP41ES Library 



o < e 2 © a 

-£ P. Nurse z 2 O z 0 

o I — I q ~r & a a c 

o I CN 1 u g n Z2 = I < 

Q j t j — Ti tu • 

c « io rs - (!) a i 







Original PCR ■ 




3' RT-PCR HHH 




5 : 4 4- 






500 bp 
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A5 B2 




Hind III Digested Positive Genomic Clones 
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5' RTPCR Strategy 

mRNA 

5' ■HMMMMMMMMMMMMM 

ddA <j -p ^ '■ Mh" 



1. Synthesis of cDNA with Specific Downstream Primer. 

mRNA 



2. Ligate Oligo with 5'-P and blocked 3' to cDNA using T4 RNA Ligase. 

« — — ■ -■ *~~««^- 

3. First Round PCR 



■ AAAAAAAAAAAAAAAA . . 



4. Second Round PCR 



FIGURE 41 



Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429) . WLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIW . . . (35) . . . 

S.c. Est2p (366). WLFRQLIPKIIQTFFYCTEISSTVT-IVYFRHDTW ...(35)... 

E.a. pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW ...(35)... 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.p. Tezlp AVIRLLPKK — NTFRLITN-LRKRF ...(61)...' 
S.c. Est2p SKMRIIPKKSNNEFRIIAIPCRGAD ...(62)... 
E.a. pl23 GKLRLIPKK — TTFRPIMTFNKKIV ...(61)... 
* *** ** * 

Motif 3 (A) AF 

h hDh GY h 
S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK ...(89)... 
S.c. Est2p ELYFMKFDVKSCYDSIPRMECMRILK ...(75)... 
E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 
***** * 

Motif 4 (B' ) 

hPQG pP hh h 
S.p. Tezlp YLQKVGIPQGSILSSFLCHFYMEDLIDEYLSF . . . (6) . . . 
S.C. Est2p Y I REDGLFQGS SLS API VDLVYDDLL EF YS EF ...(8)... 
E.a. pl23 YKQTKGI PQGLCVS S ILS SFYYATLEESSLGF . . . (14) . . . 

Y Motif 5(C) Motif 6(D) 

h F DDhhh Gh h cK h 

S.p. Tezlp VLLRVVDDFLFITVNKKDAKKFLNLSLRGFEKHiNFSTSLEKTVINFENS .(205) 
S.c. Est2p LILKLADDFLIISTDQQQVINIKKLAMGGFQKYtJAKAInIRDKILAVSSQS .(173) 
E.a. pl23 LLMP.LTDDYLLITTQENNAVLFIEKLINVSRENGFKFMMKKLQTSFPLS .(209) 
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Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1. Transform with linear fragment " 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 
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(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattgg-atat 320 
321 agctcttggagtagctcacagaaatccttacaaatct.tctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgt'ccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1M T E H H T P K S R I L R F L E N Q Y V 20 

1019 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
21 YLCTLNDY V QLVLRGSPA S S 40 

i§*79 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
41 YS NICERLRSDVQ TSFSIF L 60 

1139 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
^61 HSTVVGFDSKPDEGVQFSS P 80 
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1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 
81 K C S Q S E 86 



1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 

87 L I ANVVKQM F D E S F ERR R N L 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagaga 1405 

107 L M K G F S M 113 



1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 

114 NHEDFRAMHVNGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 

129 DLVSTFPNYLISI LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 

149 LLLEI I G 155 

1602 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

156 SDAMHYLLSKGSI FEALPND 175 

1662 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 

176 NYLQ ISGIPLFKNNVFEETV 195 

1722 TCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

196 SKKRKRTI ETS ITQN KSARK 215 

1782 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

216 EVSWN SI SI SRFS IFYRSSY 235 

1842 AAG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaattttag AT CTA TAT TTT AAC 
1907 

?3 6 K K F K Q D LYF N 245 

1908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 LH S ICDRNTVH MWLQWI F PR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 QFGLIN AFQVKQLHKVI PLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 SQSTVVPKRLLKVYPLI EQT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 AKRLH RI S LSKVYNHYC P Y I 325 



2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 
326 DTHDDEKILSYSLKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 22 67 
346 FLRSILVRVFPKL IWGNQRI 365 

2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattttaccag AC CTC GAA ACT 2336 
366 FEIILKD LET 375 
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2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 

396 ISEIEW LVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC : TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2 525 

406 KRSNAKMCLSDFEKRKQI FA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFIIPILQ SFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2 645 

446 I T E S S D LRNRT VYFRKD IWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

466 L L CRPF IT SM KMEAF E K I N E 485 

2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

-486 NNVRMDTQKT 495 

■2776 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

496 T L P P A V I R L L P K K N T F R L I T 515 

2836 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 

Ix| 516 NLRKRFLIK 524 

2907 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

525 MG S N K K . M L V S T N Q T L R P V 542 

... 2968 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

543 A S I L K H L I N E E S S G I P F N L E 562 

' 3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 VYMKLL TFKKDLLK HRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 K S C Y D R I K Q D L M F R I , V K K K L 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 

612 KDPEFVIRKYA TIHATSDRA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 TKNFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 D MV P F E K V V Q L L S M K T 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SDTLFVDF VDYWTKSSSEI F 679 



3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 
680 KM LKEHLSGHIVK 692 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 IGNSQYLQKVGIPQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3 653 

709 ILSSFLCHFYMEDLIDEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 FTKKKGSVLLRVVDDF LF IT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 

749 VN KKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 INFENSNGI INN TFFNESKK 798 

3901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 

799 RMPFFGFSVNMRSLDTLLAC 818 

396 L CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

819 PKIDEALFNST SVELTKHMG 838 

4021 AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 

839 KSFFYKI L R S 848 

4090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

849 SLASFAQVFIDITH NSKFNS 868 

4150 TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 

869 CCNIYRiiGYSM C MRAQAYLK 888 

4210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 

889 RMKDIFIPQR MFITD 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 LLN VI GRK IWKKLA 917 

4340 GAA ATA TTA J3GA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 

918 E I LGYT S R R F L S SAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 

936 LFCLGMRDGLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 

947 PSFKYHPCFEQLIYQFQSLT 966 

4529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 

967 DLIKPLRPVLRQVLFLH R RI 986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 
987 AD* 989 
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4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 4745 
4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 
482 6 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4985 

4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 

5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 

5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 

522 6 tCQgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 

5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 

5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 

5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 
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1 



met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 



arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 



10 



20 



180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 



200 



thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 

210 

gly thr. ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 

220 230 

gin gly ile pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT" CGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 



ala tyr arg phe his ala cys val leu gin leu pro phe his gin 

GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 

CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 

CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG ,GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 

GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 

CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 

ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 

CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 

CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 

CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 

ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



400 



410 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 



# • 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCAGCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl . .. YV VELLRSFF YVTETTFQKNRL . . . 

consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep pl23 . . .KSLGF APGKLRLIPKKT--TFRPIMTFNKKIV. . . 

Sp Tezl . . .QKTTLPP A VIRLLPKKN--TFRLITNLRKRFL . . . 

Sc Est2 . . .TLSNFNHSKMRIIPKKSNNEFRII AIPCRG AD . . . 

Hs TCP1 . . . ARP ALLTSRLRFIPKPD--GLRPIVNMD YV VG. . . 

consensus R PK RI 

AF 

Motif A h hDh GY h 

Ep pl23 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 . . .PEL YFMKFD VKSC YD SIPRMECMRILK. . . 

Hs TCP1 ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 

consensus F D YD 

Motif B hPQG pS hh 

Ep pl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 . . .EDKC YIREDGLFQGS SLS API VDL V YD. . . 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 

consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 ...PNVNLLMRLTDDYLLITTQENN... 

Sp Tezl . . .KKGS VLLRV VDDFLFITVNKKD . . . 

Sc Est2 . . . S QDTLILKL ADDFLII STDQQQ. . . 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH... 

consensus DD L 

Motif D Ghh cK 

Ep pl23 ...NVSRENGFKFNMKKL... 

Sp Tezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl . . .LRTL VRGVPEYGCVV. . . 

consensus G 



